Perceptually-Related F0 Parameters for Automatic Classification of Phrase Final Tones

نویسنده

  • Carlos Toshinori Ishi
چکیده

Automatic labeling of prosodic features is an important topic when constructing large speech databases for speech synthesis or analysis purposes. Perceptually-related F0 parameters are proposed with the aim of automatically classifying phrase final tones. Analyses are conducted to verify how consistently subjects are able to categorize phrase final tones, and how perceptual features are related with the categories. Three types of acoustic parameters are proposed and analyzed for representing the perceptual features related to the tone categories: one related to pitch movement within the phrase final, one related to pitch reset prior to the phrase final, and one related to the length of the phrase final. A classification tree is constructed to evaluate automatic classification of phrase final tones, resulting in 79.2% accuracy for the consistently categorized samples, using the best combination among the proposed acoustic parameters. key words: phrase finals, intonation, pitch perception, automatic labeling, prosody

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Perceptually-related acoustic-prosodic features of phrase finals in spontaneous speech

With the aim of automatically categorizing phrase final tones, investigations are conducted on the relationship between acoustic-prosodic parameters and perceptual tone categories. Three types of acoustic parameters are proposed: one related to pitch movement within the phrase final, one related to pitch reset prior to the phrase final, and one related to the length of the phrase final. A class...

متن کامل

The Effects of Boundary Tones on the f0 Scaling of Lexical Tones

Many languages display a pattern in which the f0 values of tones are higher in a phrase ending in a high boundary tone than in one ending with a low boundary tone. In some cases it is only the tones closest to the boundary tone that are affected, while in others it affects all tones in the phrase. An account is suggested in this paper, according to which these effects are due to perceptually-ba...

متن کامل

Acoustic cues to prosodic boundaries in Yami: A first look

It is well known that in many Indo-European languages speakers manipulate acoustic cues to encode different prosodic phrase boundaries. However, no such attempt has been made to investigate these effects in Austronesian languages. Therefore, this paper reports on preliminary research on the prosodic structure of Yami, an endangered Austronesian language spoken on Orchid Island, Taiwan. Two acou...

متن کامل

Voice source correlates of prosodic features in american English: a pilot study

In this paper, we examine the dependencies of voice source parameters F0(fundamental frequency), Ee(maximal glottal flow change), RK(glottal symmetry/skew), LIN (value related to source spectral tilt) and H∗ 1 −H∗ 2 (difference of formant-corrected magnitudes of the first two source spectral harmonics) on prosodic features such as pitch accents, stress, and sentence type and the interdependenci...

متن کامل

Perceptually based automatic prosody labeling and prosodically enriched unit selection improve concatenative text-to-speech synthesis

Prosody is an important factor in the quality of text-tospeech (TTS) synthesis. Typically, acoustic parameters such as f0 and duration are the only variables related to prosody that are used to determine unit selection. Our study explored adding the explicit use of linguistically and perceptually motivated prosodic categories in unit selection-based TTS. One of our goals was to automate the pro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IEICE Transactions

دوره 88-D  شماره 

صفحات  -

تاریخ انتشار 2005